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1.  Introduction. 


Decision  trees  are  often  used  to  model  algorithms  for  combinatorial  and 
geometrical  problems.  While  motivation  for  these  models  rests  primarily  on  their 
generality  and  conceptual  simplicity,  they  also  have  the  benefit  of  offering  at 
present  the  most  promising  prospect  for  proving  worst  case  lower  bounds  in  many 
problems. 

For  linear  decision  trees  several  powerful  techniques  are  known  for  bounding 
the  tree  height  from  below,  e.g.  Reingold  (9],  Dobkin  [3],  Dobkin  and  Lipton 
[4][5],  Yao  {13],  and  Yao  and  Rivest  {15]. 

Much  less  is  known  for  general  algebraic  decision  trees.  Beyond  the  naive  in¬ 
formation  bound,  Rabin’s  theorem  (Rabin  [8])  and  the  convex  hull  problem  (Yao 
[14])  are  apparently  the  only  known  results. 

The  purpose  of  this  article  is  to  provide  a  general  method  for  establishing 
lower  bounds  for  the  worst  case  performance  of  algorithms  prescribed  by  arbitrary 
algebraic  decision  trees.  Technically  this  work  extends  the  results  of  Dobkin  and 
Lipton  [4]  [5],  but  the  tools  put  to  work  here  provide  non-trivial  bounds  for  a  large 
class  of  previously  untouchable  problems. 

Before  giving  the  detailed  computational  model  it  seems  worthwhile  to  men¬ 
tion  informally  a  concrete  application. 


Theorem  1.  Any  algebraic  decision  tree  of  bounded  order  which  solves  the  n- 
dimensional  knapsack  problem  must  have  height  at  least  0(na). 


This  result  extends  the  knapsack  bounds  under  the  linear  decision  tree  model 
due  to  Dobkin  and  Lipton  [4]  and  the  ft(nlogn)  result  of  Dobkin  [3]. 

The  method  used  here  rests  critically  on  a  result  from  real  algebraic  geometry 
due  to  Milnor  [7].  Since  the  machinery  used  by  Milnor  may  not  be  familiar  to 
workers  in  complexity,  we  have  tried  to  give  an  expository  of  the  basic  facts 
necessary  for  making  this  work  self-contained.  The  bounds  discussed  here  should 
prove  useful  in  many  related  problems. 

In  the  next  section  we  rigorously  specify  the  computational  model  and  outline 
the  lower  bound  method.  The  third  section  exposits  Milnor's  inequality  and  gives  a 
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heuristic  argument  which  tries  to  pinpoint  the  necessity  for  the  more  sophisticated 
tools. 

The  fourth  section  is  devoted  to  applications  and  in  particular  to  the  proof 
of  the  result  on  the  knapsack  problem  (Theorem  1)  which  was  mentioned  above. 

The  final  section  mentions  some  open  problems  and  suggest  a  line  of  attack 
which  if  sufficiently  developed  might  add  significantly  to  the  power  of  the  present 
method. 
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2.  Computational  Model  tbe  General  Method. 

Let  W  C  be  any  set.  A  [d-tb  order)  deciiioo  tree  T  for  testing  if 
2  6  W  is  a  ternary  tree  with  each  internal  node  containing  a  test  of  the  form 
p(z i*  *2i  0,  where  p  is  a  polynomial  of  degree  at  most  d.  Each  leaf  of  T 

contains  "yes”  or  a  "no”  answer.  For  an  input  2,  the  procedure  starts  at  the  root 
and  traverses  down  the  tree.  At  each  internal  node  a  branching  is  made  according 
to  the  polynomial  test  at  that  node  and  when  a  leaf  is  reached  the  answer  to  the 
question  "Is  2  €  W”  must  be  given  correctly. 

Now  let  Cj(W)  be  tbe  minimum  height  hT  for  any  rf-th  order  decision  tree 
T  (for  the  set  W).  Our  key  objective  will  be  to  obtain  lower  bounds  on  C*(W), 
and  the  bound  given  here  will  depend  heavily  on  the  topology  of  W. 

By  we  denote  tbe  number  of  (disjoint)  connected  components  of  W.  Also 
for  any  polynomial  p{z i,za,..  .,x»)  we  set  Sp  =  {2  |  p(2)  ^  0),  and  for  any 
integers  n,  m  >  0  we  put  p{m,n)  =  max{  *  Sp  j  p  is  a  polynomial  of  n  real 
variables  and  of  degree  at  most  m}. 

The  following  elementary  result  provides  the  skeleton  of  our  method.  (To 
put  flesh  on  the  bones  will  require  the  bounds  on  p  obtained  in  the  next  section.) 

Theorem  2.  Let  W  C  be  an  open  set ,  and  let  T  be  a  d-tb  order  algebraic 
decision  tree  for  deciding  if  2  6  IV.  If  W  is  the  disjoint  union  of  N  open  sets , 
then  the  height  hr  satisfies  the  inequality 

2KTp(hrd,n )  >  N. 


Proof.  For  each  leaf  l  of  T  let  Vi  be  the  set  of  inputs  2  €  R*  leading  to  l 
and  let  It  be  the  set  of  constraints  resulting  from  the  tests.  Let  L  be  the  set  of 
leaves  l  such  that  It  consists  only  of  strict  inequalities  and  such  that  the  answer 
stored  at  l  is  "yes”.  One  should  note  that  each  V*  is  an  open  set  and  V*  C  W. 

We  now  write  W  —  W,  where  each  W,  is  a  connected  open  set  and 
the  W{  are  disjoint,  and  write  V*  =  (2:  p<t(2)  <  0, Plt  <  0, ...,p{((2)  <  0} 
where  each  ptl  is  a  polynomial  of  degree  not  greater  than  d  and  where  a  < 
hr •  As  a  consequence  of  this  representation,  V/  C  { 2  |  qt(l)  0}  =  D 


where  qi{t)  —  I|Py(I)  is  a  polynomial  of  degree  at  most  hrd.  Moreover,  each 
connected  component  of  V*  is  contained  in  at  most  one  component  of  D.  Hence, 
Vi  has  at  most  0(hTd,n )  connected  components  V«,  V&,  ...  . 

Since  each  leaf  of  T  is  correctly  labeled,  each  Vy  has  to  be  completely 
contained  in  some  W,.  Since  the  number  of  such  Vy  is  at  most  0(hrd,n)  and 
there  are  only  |£|  values  of  l  which  lead  to  “ye s"  the  number  of  components  N 
of  W  is  bounded  by  \L\P(hrd,n).  Since  2Kr  >  )£,)  the  theorem  follows.  | 


3.  Counting  Connected  Component!. 

To  use  Theorem  2  one  needs  bounds  on  /?(m,  n)  and  this  is  apparently  no 
easy  matter.  Fortunately,  there  is  a  bound  due  to  Milnor  [7]  which  is  sufficient 
for  some  applications: 


0(m,  n)  <  (m  +  2)(m  +  l)n  1 .  (3.1) 

The  proof  of  Milnor’s  inequality  rests  on  the  several  substantial  results  from  Morse 
theory  and  algebraic  topology,  but  it  is  nevertheless  possible  to  give  a  heurtistic 
indication  of  an  analogous  result. 

The  only  preliminary  needed  for  the  argument  is  Bezout’s  Theorem  which 
says  that  any  system  of  n  algebraic  equations  in  n  variable  with  degree  d  has 
either  infinitely  many  (complex)  solutions  or  at  most  dn.  For  a  classical  approach 
to  the  proof  of  Bezout’s  Theorem  one  can  consult  Enriques  [6],  or,  for  the  case 
n  =  2,  there  is  a  nice  proof  in  Seidenberg  [10]. 

To  use  Bezout’s  Theorem  we  suppose  that  p  is  a  real  polynomial  in  n  variables 
with  degree  m,  and  we  note  that  R  can  be  chosen  so  that  A  —  {p  >  0}  f)(9  = 
R2  —  i  >  0}  has  as  many  bounded  connected  components  as  {p  >  0}  has 
connected  components  (bounded  or  unbounded).  Since  each  bounded  connected 
component  of  A  must  contain  a  local  maximum  of  pq,  the  number  of  bounded 
components  of  A  is  majorized  by  the  number  of  zeros  of  the  system  Vpq  =  0. 
By  Bezout,  this  number  is  either  infinite,  or  else  bounded  by  (m  -f-  l)n. 

This  finite  bound  is  for  our  purposes  almost  as  sharp  as  Milnor’s  bound.  The 
real  work  comes  in  providing  a  rigorous  perturbation  argument  which  rules  out  the 
case  when  Bezout  gives  only  the  trivial  infinite  bound.  That  is  precisely  the  case 
which  causes  all  the  trouble  and  presents  this  section  from  being  self  contained. 

We  should  further  remark  that  a  recent  exposition  of  R.  Bott  [2]  provides  an 
intuitive  introduction  to  Milnor  [7],  where  inequality  (3.1)  is  given  as  Theorem  3. 
As  it  happens,  the  problem  of  determining  f}(m,  n)  is  actually  very  deep  and  it  is 
intimately  connected  with  Hilbert’s  16-th  Problem,  see  Arnold  [1]. 
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4.  Applications. 

We  now  use  Theorem  2  to  derive  lower  bounds.  Clearly,  the  function  2 */?(zd,  n) 
is  an  increasing  function  of  x.  Let  a{d,n,N)  be  the  minimum  z  satisfying 
2 */?(xd,n)  >  N .  Theorem  2  immediately  yields  the  following  formal  bounds: 

Any  general  upper  bounds  on  P  can  be  used  to  derive  lower  bounds  on  a  and 
hence  Cd.  In  particular,  Milnor’s  bound  (3.1)  gives  the  following  result. 

Theorem  3.  For  any  real  e, 

CA{W)  >  min{e log2  N ,  -  1)} 

a 

when  N  =  *W. 


Corollary.  If  #W  =  n (n<‘+4)n)  tor  some  fixed  6  >  0,  then 

Cd(W)  =  n(log(#W)). 

Proof.  Let  x  =  Q(W).  Then  2 *0(xd,n)  >  AT.  Hence  by  (3.1) 

2*(xd+l)n  >  N. 

Either  2*  >  AP  or  (xd  +  l)n  >  N l'~t,  proving  the  theorem.  | 

The  corollary  follows  by  writing  =  »n(l +»('*))  and  setting  t  =  $  i+#(n) 

in  the  theorem. 

Thus,  Theorem  3  gives  a  lower  bound  nonlinear  in  n  when  *W  grows  at  least 
as  fast  as  n(>+<)n.  This  is  also  necessary  since  the  theorem  only  gives  a  lower 
bound  O(n)  when  #W  =  0(rtn). 

In  the  first  example  given  below,  *  W  «  2^n*  thus  we  have  a  good  lower 
bound.  The  other  two  examples  have  *W  <  r»n,  and  Theorem  3  does  not  give 


1 


I 


nonlinear  bounds.  However,  Theorem  2  (or,  (4.1))  is  still  true  for  these  later 
examples,  and  a  better  determination  in  the  future  may  result  in  an  improved 
bound. 

Example  1.  The  Knapsack  Problem.  Given  real  numbers  Xj,  z2,  •  •  • , *»,  decide  if 
there  exists  some  subset  S  C  {1, 2, . . . ,  n}  such  that  £ieS  z,  =  1. 

In  this  case,  W  =  {(zuz2l. . .  ,zn)  |  Ils(E,es  z»  —  1)  5*  0)  •  It  was  shown 
in  Dobkin’s  and  Lipton  (1978)  >  2»n*.  Thus,  Ci(W)  =  Q(n2)  for  any  fixed 

d.  This  generalizes  the  result  of  Dobkin  and  Lipton  where  they  showed  Ci{W)  = 

n(»2). 

Example  2.  Element  Distinctness.  Given  Zi,z2,...,zn  €  Z,  is  there  a  pair  j 
with  *  j  and  z,  =  z;?  In  this  case, 

W'={(z1,z2,...,zn)|  n(z.-z,)^0}C 

•y; 

.  It  is  easily  shown  that  *W  —  n!  since  each  region  {(zj ,  z2,...,zn)  |  z<t{\)  < 
x0( 2)  <  . . .  z„(n) }  is  a  maxima]  connected  component  of  W  for  each  permutation 
a.  One  therefore  has  Ci{W)  >  a{d,n,n\). 

Example  3.  Extreme  Points.  Given  n  points  on  the  plane  does  the  convex  hull 
formed  by  them  possess  n  vertices? 

Here  W  cannot  be  expressed  by  an  easy  algebraic  relation  but  it  is  still 
possible  to  show  >  (n  —  1)!.  Obviously,  W  is  an  open  set  in  (£2)n.  For 
any  configuration  {zj,z2,...,zn}  in  W  C  (£2)n,  we  have  a  cyclical  ordering  a 
of  the  points  {  x,-  |  1  <  t  <  n  }  which  is  given  uniquely  by  taking  the  points  in 
cyclical  order.  Clearly,  any  of  the  (n  —  1)!  cyclical  permutations  can  arise  in  this 
way  so  all  that  remains  is  to  show  that  if  a  a1  then  the  configurations  which 
give  rise  to  these  permutations  are  in  disjoint  components  of  W. 

For  each  configuration  in  W  we  consider  the  (£)  element  array  A  given 
by  A(Z'ZjZ*)  where  A  is  the  signed  area  of  the  triangle  formed  by  the  3*set 
{ZiZjZ/t}  C  {zj ,  z2, . . . ,  zn}.  If  the  configuration  corresponding  to  a  is  con¬ 
tinuously  deformed  in  any  way  to  the  configuration  for  o'  then  Ac  is  transformed 
continuously  into  Aai.  Since  a  and  o'  differ  there  is  some  triple  {z,z;z*}  for 
which  A(Z|Zyz*)  has  differing  signs  in  Ac  and  Ac<.  By  the  intermediate  value 
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theorem  there  is  therefore  some  time  during  the  continuous  deformation  when 
A (ziXjZk)  =  0.  This  says  that  x,,  Xj,  z*  are  then  co-linear  and  at  that  point 
there  are  at  most  n  —  1  extreme  points  in  the  configuration.  This  proves  that 
any  passage  from  a  to  o*  must  go  out  of  W,  so  o  and  o*  correspond  to  different 
components. 

The  main  consequence  of  the  preceding  bound  is  that 

Cd(W)>a[d,2n,(n- 1)!) 

and  it  was  originally  hoped  that  this  would  be  sufficient  to  prove  a  conjecture  of 
Yao  [14]  that  any  algebraic  decision  tree  of  order  d  for  the  extreme  point  problem 
must  have  height  O(nlogn).  The  Milnor  bound  in  this  case  is  not  sufficiently 
sharp  to  obtain  the  desired  bound.  We  indicate  in  the  next  section  a  bound  which 
would  be  sufficient. 

While  these  last  two  examples  are  disappointing  in  that  they  do  not  give  the 
conjectured  non-linear  lower-bounds,  one  should  note  that  since  only  a  yes-no 
answer  is  required  there  is  a  logical  necessity  of  only  2  terminal  leaves.  So,  the 
information  theoretic  bound  in  these  two  cases  gives  only  the  absurd  bound  log2  2. 


5.  Open  Problems  and  Directions. 

Surely  the  most  interesting  and  important  problems  pivot  about  finding 
sharper  bounds  on  p(m,n).  It  is  conceivable  that  p{m,n)  =  which 

could  imply  by  Theorem  2  that  Ci{W)  =  0(^(log2  N  —  m)).  This  bound  would 
yield  a  O(nlogn)  lower  bound  in  Examples  2  and  3  for  fixed  d. 

In  fact,  a  somewhat  weaker  result  will  suffice  for  this  purpose.  Let  p(d,  m',  n) 
be  the  maximum  of  # Sp  for  any  p  of  the  form  FlfLi  P»(*ii  z2i  •  •  •  >  z»)»  with  each 
Pi  of  degree  not  greater  than  d.  Clearly  p(d,m',n)  <  /3(dm',n).  The  result  one 
really  needs  in  Examples  2  and  3  is  (3(d,m!,n )  =  Can  one  prove 

better  bounds  on  fl(d,  m',n)  than  on  /?(m,  n)?  Here  we  note  that  it  is  not  hard 
to  see  that 

P{  1,  n)  ^  S  (^  ) '  n<m!  (5.1) 

since  p(  1,  m*,  n)  just  equals  the  number  of  regions  of  R,n  which  can  be  partitioned 
by  m'  hyperplanes.  (This  is  proved  in  Steiner  (1826)  (11]  which  is  in  the  first 
volumn  of  Crelle’s  J.  Reine  Ang.  Mat h.  and  which  is  better  remembered  for 
containing  five  fundamental  papers  of  N.  H.  Abel.  For  modern  treatment  of  (5.1) 
see  Wetzel  (12]  and  the  refrcnces  given  there.) 

A  more  modest  approach  to  the  problems  suggested  by  Examples  2  and  3 
rest  on  obtaining  bounds  for  any  small  values  of  d  >  2.  It  is  known  (Yao  (14]) 
that  C2(W)  =  fl(nlogn)  in  Example  3,  but  there  are  no  other  known  non-linear 
lower  bounds  even  in  the  case  d  =  3. 
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